Assessing and Improving the Quality of Knowledge Discovery Data

نویسنده

  • Herna L. Viktor
چکیده

Data quality has a substantial impact on the quality of the results of a Knowledge Discovery from Data (KDD) effort. The poor quality of realworld data, as contained in many large data repositories, poses a serious threat to the future adoption of this new technology. Unfortunately, data quality assessment and improvement are often ignored in many KDD efforts, leading to disappointing results. This chapter discusses the use of data mining and data generation techniques, including feature selection, case selection and outlier detection, to assess and improve the quality of the data. In this approach, redundant low quality data are removed from the data repository and new high quality data patterns are dynamically added to the data set. We also point out that data capturing is part of the social practices of office work, and this fact must be taken into account in designing the data capturing processes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing an Ontology for Knowledge Discovery in Iran’s Vaccine

Ontology is a requirement engineering product and the key to knowledge discovery. It includes the terminology to describe a set of facts, assumptions, and relations with which the detailed meanings of vocabularies among communities can be determined. This is a qualitative content analysis research. This study has made use of ontology for the first time to discover the knowledge of vaccine in Ir...

متن کامل

Assessing the effectiveness of knowledge management using Analytic Network Process

Knowledge management in higher education is a set of organizational processes that support creating and transferring the knowledge in these institutions and allows for achieving organizational and university objectives. Therefore, for the proper management of organizational knowledge, appropriate tools are needed to be able to be aware of the effectiveness of knowledge management in organizatio...

متن کامل

Assessing Environmental Literacy and its Relationship with Environmental Ethics

Background: Attention to environmental literacy and ethics in agriculture has a decisive role in nature conservation and production resources. If farmers do not have environmental literacy, irreparable damage to the environment is expected. The purpose of this study was to evaluate environmental literacy and its relationship with environmental ethics among farmers in Torbat-e-Heydariyeh. Metho...

متن کامل

بررسی کیفیت زندگی و عوامل مؤثر بر آن در بیماران مبتلا به نارسایی احتقانی قلب

Background & Aim: Improving the quality of life is generally one of the main goals in caring of the patients with congestive heart failure, so identifying factors affecting it is significantly important. This study was conducted to determining the quality of life of these patients. Methods & Materials: 184 of patients congestive heart failure who referred to clinics of Tehran University of Medi...

متن کامل

Investigating the effect of Interventions on improving the Service Quality of Physiotherapy Clinic in Rehabilitation Faculty of Tabriz in 2011-2012

Background & Objective: Quality is the main indicator in assessing health system performance and service quality which refers to non-clinical aspect of health care. This study aims at surveying and improving service quality of delivered care in physiotherapy clinic of Tabriz rehabilitation faculty.Materials & Methods: The present study is an interventional one which collects the data from 324 p...

متن کامل

Application of Rough Set Theory in Data Mining for Decision Support Systems (DSSs)

Decision support systems (DSSs) are prevalent information systems for decision making in many competitive business environments. In a DSS, decision making process is intimately related to some factors which determine the quality of information systems and their related products. Traditional approaches to data analysis usually cannot be implemented in sophisticated Companies, where managers ne...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016